Rank in Wordlist | Frequency | Word |
---|---|---|
6595 | 21 | 1,000 |
9690 | 12 | 1,050 |
9691 | 12 | 1,200 |
11733 | 9 | 15,000 |
12715 | 8 | 1,500 |
12765 | 8 | 20,000 |
12778 | 8 | 30,000 |
13889 | 7 | 1,100 |
13940 | 7 | 2,000 |
15472 | 6 | 2,049 |
Rank in Wordlist | Frequency | Word |
---|---|---|
10711 | 11 | مانٹریال(موں |
17331 | 6 | یعقوب(ع |
17796 | 5 | f(x |
21012 | 4 | آپ(ع |
26155 | 3 | آپ(ص |
34800 | 2 | M(t |
35157 | 2 | Q(0 |
37860 | 2 | ایتھنز(آتھینا |
38636 | 2 | بٹ(1000 |
42509 | 2 | صادق(ع |
Rank in Wordlist | Frequency | Word |
---|---|---|
5805 | 26 | گاؤں)، |
6591 | 22 | ہے)۔ |
9099 | 14 | ٹاؤن)، |
9146 | 14 | کمیونٹی)، |
10243 | 12 | ہیں)۔ |
17323 | 6 | ہے)، |
19678 | 5 | پی)، |
20286 | 4 | 2001ء)، |
21067 | 4 | ادبیات)، |
21771 | 4 | تھا)۔ |
Rank in Wordlist | Frequency | Word |
---|---|---|
31572 | 2 | %1 |
31573 | 2 | %60 |
31574 | 2 | %85 |
31575 | 2 | %90 |
32834 | 2 | 25% |
33343 | 2 | 50% |
33502 | 2 | 70% |
48920 | 1 | %0.0156۔ |
48921 | 1 | %10 |
48922 | 1 | %100)سرمایہ |
Rank in Wordlist | Frequency | Word |
---|---|---|
33722 | 2 | AT&T |
61869 | 1 | D.%20Course%20work%20%20Syllabus%20P-II,%20III%20&%20IV.pdf |
Rank in Wordlist | Frequency | Word |
---|---|---|
48917 | 1 | $200,000 |
48918 | 1 | $AU |
48919 | 1 | $، |
59351 | 1 | A$ |
81585 | 1 | امریکی$47 |
117047 | 1 | ٤٥٠$ |
Rank in Wordlist | Frequency | Word |
---|---|---|
5348 | 28 | People's |
15644 | 6 | L'Islet |
20595 | 4 | Côte-d'Or |
25416 | 3 | George's |
25634 | 3 | O'Higgins |
25674 | 3 | Pont-l'Évêque |
25729 | 3 | Saint-André-de-l'Eure |
25731 | 3 | Saint-Aubin-d'Aubigné |
31449 | 3 | ہے'۔ |
33757 | 2 | Aix-d'Angillon |
Rank in Wordlist | Frequency | Word |
---|---|---|
9585 | 13 | ٹوئنٹی/20 |
16983 | 6 | ١٧/١٤ |
17799 | 5 | http://ecp |
20107 | 4 | 1/2 |
25350 | 3 | D/H |
25967 | 3 | http://hamzaurduarchive |
25968 | 3 | https://www |
26645 | 3 | اور/ |
30314 | 3 | ٹی/20 |
31844 | 2 | 1/14 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots